Joint Modelling of Confounding Factors and Prominent Genetic Regulators Provides Increased Accuracy in Genetical Genomics Studies

نویسندگان

  • Nicoló Fusi
  • Oliver Stegle
  • Neil D. Lawrence
چکیده

Expression quantitative trait loci (eQTL) studies are an integral tool to investigate the genetic component of gene expression variation. A major challenge in the analysis of such studies are hidden confounding factors, such as unobserved covariates or unknown subtle environmental perturbations. These factors can induce a pronounced artifactual correlation structure in the expression profiles, which may create spurious false associations or mask real genetic association signals. Here, we report PANAMA (Probabilistic ANAlysis of genoMic dAta), a novel probabilistic model to account for confounding factors within an eQTL analysis. In contrast to previous methods, PANAMA learns hidden factors jointly with the effect of prominent genetic regulators. As a result, this new model can more accurately distinguish true genetic association signals from confounding variation. We applied our model and compared it to existing methods on different datasets and biological systems. PANAMA consistently performs better than alternative methods, and finds in particular substantially more trans regulators. Importantly, our approach not only identifies a greater number of associations, but also yields hits that are biologically more plausible and can be better reproduced between independent studies. A software implementation of PANAMA is freely available online at http://ml.sheffield.ac.uk/qtl/.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mapping Determinants of Gene Expression Plasticity by Genetical Genomics in C. elegans

Recent genetical genomics studies have provided intimate views on gene regulatory networks. Gene expression variations between genetically different individuals have been mapped to the causal regulatory regions, termed expression quantitative trait loci. Whether the environment-induced plastic response of gene expression also shows heritable difference has not yet been studied. Here we show tha...

متن کامل

A review of microarray experimental design strategies for genetical genomics studies

Genetical genomics approaches provide a powerful tool for studying the genetic mechanisms governing variation in complex traits. By combining information on phenotypic traits, pedigree structure, molecular markers and gene expression, such studies can be used for estimating heritability of mRNA transcript abundances, for mapping expression quantitative trait loci (eQTL), and for inferring regul...

متن کامل

Review of microarray experimental design strategies for genetical genomics studies.

Genetical genomics approaches provide a powerful tool for studying the genetic mechanisms governing variation in complex traits. By combining information on phenotypic traits, pedigree structure, molecular markers, and gene expression, such studies can be used for estimating heritability of mRNA transcript abundances, for mapping expression quantitative trait loci (eQTL), and for inferring regu...

متن کامل

Invited Review CALL FOR PAPERS 2nd International Symposium on Animal Functional Genomics Review of microarray experimental design strategies for genetical genomics studies

Rosa GM, deLeon N, Rosa AJM. Review of microarray experimental design strategies for genetical genomics studies. Physiol Genomics 28: 15–23, 2006. First published September 19, 2006; doi:10.1152/physiolgenomics.00106.2006.—Genetical genomics approaches provide a powerful tool for studying the genetic mechanisms governing variation in complex traits. By combining information on phenotypic traits...

متن کامل

Covariate-Adjusted Precision Matrix Estimation with an Application in Genetical Genomics.

Motivated by analysis of genetical genomics data, we introduce a sparse high dimensional multivariate regression model for studying conditional independence relationships among a set of genes adjusting for possible genetic effects. The precision matrix in the model specifies a covariate-adjusted Gaussian graph, which presents the conditional dependence structure of gene expression after the con...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2012